# Low word error rate
Whisper Small Vi
MIT
An automatic speech recognition model fine-tuned on Vietnamese speech data based on openai/whisper-small, improving Vietnamese transcription accuracy and robustness
Speech Recognition
Transformers Other

W
namphungdn134
334
2
W2v Bert 2.0 Naijavoices Clearglobal Hausa 500hr V0
MIT
A Hausa speech recognition model fine-tuned from facebook/w2v-bert-2.0, trained on 500 hours of Hausa data with a word error rate of 7.47%
Speech Recognition
Transformers

W
asr-africa
16
1
Whisper Persian Turbooo
MIT
Persian automatic speech recognition model optimized based on OpenAI Whisper-large-v3-turbo, supporting medical field applications
Speech Recognition
Transformers Other

W
hackergeek98
51
2
Whisper Base Vi
MIT
A speech recognition model fine-tuned on 100 hours of Vietnamese speech data based on openai/whisper-base model, improving Vietnamese transcription accuracy
Speech Recognition
Transformers Other

W
namphungdn134
215
3
Whisper Large V3 Persian Common Voice 17
MIT
A Persian automatic speech recognition model fine-tuned based on Whisper Large v3, trained using the Common Voice 17 dataset, significantly improving Persian recognition accuracy.
Speech Recognition
Transformers Other

W
msghol
442
2
Whisper Large V3 Vaani Hindi
Apache-2.0
A Hindi speech recognition model fine-tuned based on OpenAI's Whisper-Large-V3, trained on approximately 718 hours of transcribed Hindi speech data
Speech Recognition
Safetensors
W
ARTPARK-IISc
15.55k
3
Indian Accent English Whisper Finetuned
MIT
Fine-tuned the openai/whisper-large-v3-turbo based on the Indian English accent dataset, which is more suitable for speech recognition of Indian English accents.
Speech Recognition
Transformers English

I
Tejveer12
1,733
1
Wav2vec2 Large Xlsr 53 Hungarian
Apache-2.0
An automatic speech recognition model fine-tuned on the Hungarian Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers Other

W
sarpba
17
1
Whisper Small Fr
Apache-2.0
This is a Whisper-small speech recognition model fine-tuned on French datasets, reducing the word error rate by 6.793 percentage points compared to the baseline model.
Speech Recognition
Transformers French

W
mozilla-ai
30
1
Whisper Uz
Apache-2.0
Uzbek automatic speech recognition model fine-tuned from OpenAI Whisper Medium
Speech Recognition
Transformers Other

W
mustafoyev202
110
1
Kb Whisper Small
Apache-2.0
Whisper model released by the Swedish National Library, optimized for Swedish, trained on 50,000+ hours of Swedish speech data, outperforming the original OpenAI version
Speech Recognition
Transformers Other

K
KBLab
28.61k
3
Kb Whisper Medium
Apache-2.0
A Whisper model trained on over 50,000 hours of Swedish speech data released by the National Library of Sweden, excelling in Swedish speech recognition tasks
Speech Recognition
Transformers Other

K
KBLab
691
3
Kb Whisper Large
Apache-2.0
A Swedish speech recognition model based on the Whisper architecture released by the National Library of Sweden. The training data exceeds 50,000 hours, significantly reducing the word error rate.
Speech Recognition
Transformers Other

K
KBLab
8,880
42
Quran Whisper Base Fine Tune
Apache-2.0
This model is a fine-tuned Arabic speech recognition model based on openai/whisper-base on the quran-ayat-speech-to-text dataset, specializing in the task of converting Quranic verses from speech to text.
Speech Recognition
Transformers Arabic

Q
Baselhany
35
1
Whisper Large V3 Turbo STT Zeroth KO V2
A Korean automatic speech recognition model optimized based on Whisper Large v3 Turbo, providing high-accuracy transcription with timestamps
Speech Recognition
Transformers Korean

W
o0dimplz0o
662
3
Chunkformer Large Vie
A large-scale Vietnamese automatic speech recognition model based on the ChunkFormer architecture, fine-tuned on approximately 3000 hours of publicly available Vietnamese speech data, with excellent performance.
Speech Recognition
PyTorch Other
C
khanhld
1,765
12
Whisper Finetuned Amharic
Apache-2.0
Amharic speech recognition model fine-tuned from openai/whisper-small, achieving a word error rate of 2.0538% on the evaluation set
Speech Recognition
Transformers

W
seyyaw
57
1
Wav2vec2 Large Xls R 300m Ru
Apache-2.0
This model is a Russian automatic speech recognition (ASR) model fine-tuned on the common_voice_17_0 dataset based on facebook/wav2vec2-xls-r-300m, with a word error rate (WER) of 0.195.
Speech Recognition
Transformers

W
NLPVladimir
56
1
Whisper Small Tajik
Apache-2.0
A Tajik automatic speech recognition model fine-tuned from OpenAI Whisper-small, trained on Google Fleurs dataset with a word error rate of 24.26%.
Speech Recognition
Transformers Other

W
abduaziz
25
1
Whisper Small For Quran
Apache-2.0
A Quran speech recognition model fine-tuned based on OpenAI Whisper-small, specifically designed for Arabic Quran audio
Speech Recognition
Transformers Arabic

W
areaz
26
2
German RAG WHISPER LARGE V3 TURBO HESSIAN AI
MIT
German speech recognition model optimized based on Whisper Large v3 Turbo, fine-tuned on a 13-hour curated dataset, significantly improving German recognition accuracy
Speech Recognition
Transformers German

G
avemio
282
1
Whisper Uz
Apache-2.0
Uzbek speech recognition model fine-tuned on Whisper Base, trained on the Common Voice dataset
Speech Recognition
Transformers Other

W
jamshidahmadov
1,179
3
Whisper Khanacademy Large V3 Turbo Tr
MIT
An automatic speech recognition (ASR) model fine-tuned on Turkish Khan Academy dataset based on OpenAI Whisper-large-v3-turbo
Speech Recognition
Transformers Other

W
ysdede
31
1
Whisper Tiny German 1224
Apache-2.0
German speech recognition model optimized based on Whisper architecture, with 39 million parameters, supporting efficient German speech transcription
Speech Recognition
Transformers German

W
primeline
322
9
Whisper Large V3 Lv Late Cv19
Apache-2.0
A Latvian automatic speech recognition model fine-tuned based on whisper-large-v3, trained by AiLab.lv, supporting Latvian speech-to-text tasks.
Speech Recognition Other
W
AiLab-IMCS-UL
162
1
Whisper Base Hungarian V1
Hungarian speech recognition model fine-tuned based on OpenAI Whisper-base, trained on 1200 hours of Hungarian data, outperforming similar models
Speech Recognition
Transformers Other

W
sarpba
26
7
Whisper Large V3 Turbo Turkish
MIT
A Turkish speech recognition model fine-tuned on the Common Voice 17.0 dataset based on openai/whisper-large-v3-turbo
Speech Recognition
Transformers Other

W
selimc
289
6
Whisper Large V3 Turbo Es
MIT
Spanish speech recognition model fine-tuned based on Whisper-large-v3-turbo, achieving a word error rate reduction to 5.34% on the Common Voice 17.0 Spanish dataset
Speech Recognition
Transformers Spanish

W
adriszmar
52
4
Whisper Large V3 Turbo Arabic
Apache-2.0
Based on the transformers library, this is a fine-tuned version of openai/whisper-large-v3-turbo on the common_voice_11_0 dataset, optimized specifically for Arabic speech recognition.
Speech Recognition
Transformers

W
mboushaba
1,696
1
Finetuned Whisper Mr
Apache-2.0
A Whisper small speech recognition model fine-tuned on the Common Voice 17.0 Marathi dataset, based on simran14/mr-model-h
Speech Recognition
Transformers Other

F
simran14
38
1
Whisper Small Kurdish Sorani 10
Apache-2.0
A Kurdish Sorani dialect speech recognition model fine-tuned based on openai/whisper-small
Speech Recognition
Transformers

W
roshna-omer
95
1
Monsoon Whisper Medium Gigaspeech2
Apache-2.0
Monsoon-Whisper-Medium-GigaSpeech2 is a Thai automatic speech recognition (ASR) model, based on Whisper-Medium and fine-tuned on the GigaSpeech2 dataset, suitable for speech recognition in real-world scenarios.
Speech Recognition
Transformers

M
scb10x
546
5
Whisper Large V3 Az
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the Azerbaijani Common Voice 17.0 dataset based on OpenAI's Whisper Large v3, achieving a word error rate (WER) of 1.195%.
Speech Recognition
Transformers Other

W
nsalahaddinov
96
1
Whisper Large V3 Russian
A Russian speech recognition model fine-tuned based on OpenAI Whisper-large-v3, optimized for Russian recognition performance
Speech Recognition
Transformers Other

W
antony66
6,665
60
Whisper Large V3 Turkish Test1
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 17.0 Turkish dataset based on OpenAI Whisper-large-v3
Speech Recognition
Transformers Other

W
erdiyalcin
21
3
Geez T5 Big 15k
A Ge'ez (Ethiopian) text processing model based on the T5 architecture, supporting text generation and transformation tasks
Large Language Model
Transformers

G
Samuael
38
1
Distil Large V3 Ct2
MIT
Distil-Whisper is a distilled version of the Whisper model, optimized for long-form transcription, offering faster inference speed and improved word error rate (WER) performance.
Speech Recognition English
D
distil-whisper
58
6
Whisper Native Elderly 9 Dutch
Apache-2.0
A speech recognition model fine-tuned on Dutch datasets based on OpenAI Whisper Large V2, with a word error rate of 10.14%
Speech Recognition
Transformers Other

W
golesheed
22
1
Whisper Large V3 Ft Cv16 Mn
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 16.0 dataset based on OpenAI Whisper Large V3
Speech Recognition
Transformers

W
sanchit-gandhi
34
1
Wav2vec2 Xls R 300m Bp1 Es Eu
Apache-2.0
A fine-tuned Basque automatic speech recognition model based on facebook/wav2vec2-xls-r-300m, achieving a 3.67% word error rate on the Basque Parliament dataset
Speech Recognition
Transformers

W
gttsehu
49
1
- 1
- 2
- 3
- 4
- 5
- 6
- 8
Featured Recommended AI Models